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ABSTRACT 

The presence of positive serial correlation (autocorrelation) in 
remotely sensed data results in an underestimate of the variance* 
covariance matrix when calculated using contiguous pixels. This under- 
estimate produces an Inflation in F statistics, For a set of Thematic 
Mapper Simulator data (TMS), used to test the ability to discriminate a 
known geobotanicai anomaly from its background, the inflation in F 
statistics related to serial correlation is between 7 and 70 times. This 
means that significance tests of means of the spectral bands initially 
appear to suggest that the anomalous site is very different in spectral 
reflectance and emitance from its background sites. However, this 
difference often disappears and is always dramatically reduced when 
compared to frequency distributions of test statistics produced by the 
comparison of simulated training sets possessing equal means, but 
which are composed of autocorrelate d observations. 


THE INFLUENCE OF AUTOCORRELATION IN 
SIGNATURE EXTRACTION - AN EXAMPLE FROM 
A GEOBOTANICAL INVESTIGATION OF COTTER BASIN, MT 

INTRODUCTION 

Recently researchers have demonstrated that remotely sensed data from a number of sensor 
systems are autocorrelated. Such conclusions have been documented for Landsat, (Coberly, 1973; 
Craig, 1976, 1 979; Tubbs and Coberly, l978;Tubbs, I979;Cralg and Labovltz, 1980), Thematic 
Mapper Simulator (TMS) (Labovitz et al., 1981) and the Viking Lander (Craig et al., 198 1), Along 
with these findings, researchers have been studying what effects the violation of the assumption of 
independence has upon maximum likelihood, Bayesian classifiers and parameter estimation. This 
paper explores the influence of autocorrelation upon statistics in a hypothesis testing situation. 

BACKGROUND AND PREVIOUS RESEARCH 

Autocorrelation is a property defined for sequences, Xj, X^, * * *, X^_p Xj, Xj^j , * * Xn 

and is a measure of the randomness of the sequence. In a remote sensing context the sequence is 

a scan line and X. is the digital number (DN) for the ith pixel. The autocorrelation in a scan line 

is measured for specific distances between pixels. For example, the autocorrelation of adjacent 

pixels, known as the lag I autocorrelation, is estimuted by 

n-1 ( Xt - X)(Xt^, - X) 

1-1 S2(n~2) 

where: X is the sample mean of the scan line; 

is the sample variance of the scan line; 
n is the number of pixels in the scan line'. 

In general, the autocorrelation can be measured for lag k (k = 1, 2, , . n - 1), and is estimated 
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The function which maps a lag number to the degree of autocorrelation is known as the 
autocorrelation function (acQ. The acf can be thought of as taking a sequence, sliding It incre* 
mentally along itself and calculating a correlation coefficient at every increment. The autocor- 
relation is, therefore, proportional to paired deviations from the mean at a specific spacing, If the 
acf is not zero for all lags, then there is a nonrandom pattern In the deviations; and this pattern 
can be expressed by a function which relates the DN for the ith pixel to DN’s for the previous 
pixels, In this situation we speak of the system as having a memory and say that the data is 
autocorreiated. 

• 

Basu and Odell (1974) found that use of autocorreiated data produced biased estimates of 
the variance-covariance matrix. This conclusion was based upon the assumption of a equicorre- 
lation dependence structure, i.e. the correlation between Xj and Xj - p, for ail i # j. Basu et ai. 
(1976) showed that if the simple equicorre lation had a positive coefficient, i.e, p > 0, then the 
bias in estimation of the variance-covariance matrix, ?, resulted in confidence limits, about the 
mean, for example, which were too narrow. Intuitively, the assumption of equicorreiation is not very 
appealing because it implies that the correlation between pixels in a scene is the same no matter 
what the value of the spacing or distance between the pixels. However, the general direction of the 
bias appears to be correct because Cliff and Qrd (1981) have described a related bias in the distri- 
bution of the t-statistics (student’s t) when calculated on the estimated means from two popula- 
tions, Xj and ^ 2 , when Xj and X 2 are autocorreiated. Positive autocorrelation results in the 
underestimation of the standard error of the mean and consequent inflation of the t value. Gast- 
wirth and Rubin (1971) found the presence of even slight serial correlation (autocorrelation) had a 
strong influence on the confidence levels of the distributions of not only the mean but also the 
two one-sample nonparametric tests, the sign test and the Wllcoxon test. Tubbs and Coberly 
(1978) and Tubbs (1979) proposed an autoregression model of order 1 AR(1) as being more real- 
istic than equicorreiation for the dependence structure among the pixels from a Landsat scan line. 
The authors then went on to investigate the influence of the AR(1) structure upon estimation of fre- 
quency distribution parameters and the Bayesian and maximum likelihood classification procedures. 

2 
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The authors found that the estimate of the variance •covariance matrix wa* longer unbiased, 
that is the commonly used estimate, S, of J , resulted in an underestimate if .ulculated from adja- 
cent pix^-’*' >**hich were positively autocorrelated. For the Bayesian classification procedure, the 
authors showed that the misclassification probabilities were unaffected only when the costs of mls- 
classifying were equal and the prior probabilities were assumed to be equal, Further, the probs'^ 
billty of misclassification errors from a maximum likelihood classifier were appropriate only when 
the training set sizes were equal. Clearly, there are circumstances, including the need to obtain 
unbiased estimates of other parameters such as type I and type II errors which would result in 
experimental situations where these requirements are not met. Tubbs (1980) extended the results 
of Basu et al. (1976) and examined the Infiuence of an AR(1) dependence structure (Instead of 
eqificorrelation) upon the confidence regions of the mean and variance. The author found that 
when the observations possewed positive serial correlation, the assumption of stochastic inde- 
pendence resulted In confidence intervals that are voo restrictive, This meant that a 100 (1 - a) 
percent confidence region calculated under the assumption of random samples was <100 (I -Oi) 
percent if the data were positively serially correlated. 

Craig (1976, 1979) demonstrated for Landsat data that an autoregressive moving average of 
order one-one, an ARIMA (i, 0, 1) (Box and Jenkins, 1970) was a more appropriate model than 
the AR (I) model. O^e will discuss the ARIMA (1,0, 1) model in greater detail later.) Further, 
Craig (1979) suggested that a sampling scheme with a minimum of 10 pixels between samples be 
employed in order to produce independent observations. Craig and Labovjtz (1980) suggested 
that the ARIMA (1,0, 1) was a general model for Landsat data and that certain physical u^tributes 
of the scene were related to the degree of autocorrelation through systematic changes in the values 
of the coefficients of the ARIMA (1,0, 1). 

Craig (1981) showed that autocorrelated data, in comparison to independent observations 
tended to show smaller varmbllity for short distances (contiguous pixels) and larger variation for 
longer distances. He ^ilso demonstrated that the Inflation in the vsxiation behaved as a function of 
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the coefficients in the ARIMA (1,0, 1) modei. Craig suggested three methods for compensating for 
autocorrelation in data - 1) filtering out the ARIMA (1, 0, 1) model; 2) resampling the data with 
an appropriate spacing between pixels; and 3) adjusting the classification algorithms. 

Campbell (1981) used a sampling scheme to adjust his classification for the presence of auto- 
correlation. In this paper the author demonstrated effects of autocorrelation by comparing classifi- 
cation errors between classifications based on contiguous pixels and classifications based on training 
sets composed of randomly selected pixels. The data used was Landsat MSS data from near Rich- 
mond, VA. The results showed an almost universal improvement in classification accuracies under 
the random sampling alternative, This result held for data acquired during several portions of the 
year, Since the random sampling scheme used did not guarantee Independent observations, it is 
possible that a further improvement mip[ht be achieved by a systematic sample with a suitable 
spacing, 

OBJECTIVE OF STUDY 

The objective of this paper is to examine what influence autocorrelation has upon the re- 
searcher’s decision concerning the presence of distinguishable populations and his ability to assign 
explanatory power to sources of variation. When the researcher makes these decisions based upon 
the behavior of functions of samples (statistics) from the respective populations, the problem is 
recast as examination of the relationship between the degree of autocorrelation and the decision 
statistic, 

This work arose out of ongoing research in the utility of remote sensing for geobotanical 
exploration. In this area of geobotany, the explorationist is trying to recognize differences in the 
spectrum of vegetation growing in soils containing anomalously high concentrations of stragetic 
metals (e.g. copper, lead, zinc, molybdenum, and chromium) as compared to vegetation growing 
in soils containing background concentrations of trace metals. This search is performed with the 
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hope that the soil is derived from a nearby ore body, If such a difference could be detected in 
vegetation spectra, explorationists could use this difference to *’map” promising areas, 

THE STUDY REGION THE COTTER BASIN GEOBOTA^ ' |iCAL TEST SITE 

The Cotter Basin geobotanical test site Is located in Lew;s and Clark County, Montana and 
can be found on the Stonewall Mountain Quadrangle (see Figure 1). The area is In the Northern 
Rocky Mountains Region and possesses considerable relief, The test site region has been studied 
previously by Collins et ul. (1977) and Collins et al, (1980), The remotely sensed data used In their 
research were acquired by a prototype high resolution (500 channels from 0.400 }mx\ to 1,0 Mm) 
airborne spectroradiometer system (Chiu and Collins, 1978) during one overnight in 1976 and a 
second during 1978, 

Mineralization at the Cotter Basin test site oceurs in westerly striking shear zones In Precam= 
brian sedimentary rocks (Collins et al., 1980). The mineralization is primarily sulfides of copper 
and lead with ancillary high concentrations of zinc, silver and gold. Soil concentrations along shear 
zones are as high as 10,000 parts per million (ppm) copper, 1,500 ppm lead and 400 ppm zinc 
(Collins etal„ 1980), 

Description of Data Used in the Present Study 

The data used were acquired using the Landsat Thematic Mapper Simulator (TMS)-NSOOl- 
(Richard et al., 1978) mounted aboard a NASA C130 aircraft. The data came from flight line 
number 3, flown on August 29, 1979 commencing at 1830 GMT (12 j 30 p.m, MDT), The flight line 
was flown in a south-to-north direction at about 3.0 km (10,000 feet) above ground level. Since the 
instantaneous field-of-view of the NSOOl is 2.5 milliradians, the pixels are approximately 7.5 m (25 
feet) on a side at nadir. Table 1 gives the spectral bands for which data were collected. 

Training Site Selection 

For the purposes of this study two classes were defined: anomalous and background. The 
training site for the anomalous class was located- so that it fell within the intersection of the soil and 
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Table I 

TMS (NSOO I ) Spectral Bands 


Channel Number 

Band Width 
(Micrometers) 

1 

0.42 

0,52 

2 

0.52- 

0.60 

3 

0.63- 

0.69 

4 

0.76 - 

0.90 

5 

1.00- 

1.30 

6 

1.55- 

1.75 

7 

2.08- 

2,35 

8 

10.40- 

12.50 


remote sensing anomalies defined by Collins et al. (1980), The location of the anomalous training 
site was also constrained by the selection of two background sites such that all three sites were 
located on the same slope and aspect and at the same angle off-nadir with respect to the MSOOl 
flight line. The three training sites were composed of squares of contiguous pixels, 10 pixels on a 
side (100 pixels per training site). 

Initial Comparisons of Training Sites 

Table 2 contains the sample means by training site for each of the eight bands. The results 
of tests of the hypotheses of mean vector equality for combinations of the anomalous and back- 
ground sites is given in Table 3. We clearly reject the null hypothesis not only for comparisons of 
anomalous versus background sites, but also for the comparison of the two backgrounds. Thus all 
three sites are very significantly different from one another. Further examination of the differences 
in mean reflectance was accomplished by looking at univariate (by band) comparisons. Table 4 
gives the results from 32 univariate tests (8 bands X 4 combinations of training sites). The F ratios 
formed were highly significant (a « 0.006) for 30 of the 32 tests, indicating that the significance 
found in the multivariate tests was not attributable to some systematic subset of the spectral bands. 

Thus, we find significant differences among all three sites for virtually all spectral bands, that 
is both for bands containing vegetative information as well as for bands which are insensitive to the 
presence of vegetation. Therefore, geobotanical sources of variation are unlikely to account for 

such pervasive significance. In looking for explanatory effects, we start with the testing procedure. 
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Sample means for the eight TMS bands for each test site, Each mean is computed 

from 100 contiguous pixels. 


Band 

Center 

X(Mm) 


Site 


A 


B2 

1 

0.48 

88.5 

116.8 

115.6 

2 

0.56 

57.1 

68.4 

92.7 

3 

0.66 

28.8 

32.4 

51.3 

4 

0.83 

66.2 

68.3 

108.7 

5 

1.15 

66.3 

73.9 

124.6 

6 

1.65 

32.9 

38.0 

77.5 

7 

2.2 

52.0 

62.5 

87.0 

8 

11.4 

35.9 

47.4 

77.2 


A - anomalous site 
B i - background site I 
- background site 2 


Table 3 

Results from tests of null hypotheses of mean vector equality, Is the observed 
Wilks' Lambda value, G is the related value of the F statistic, Fq is the preselect 
critical value of ct * 0.0 1, and P (F > G) is the probability that an F random variable 
with the appropriate degrees of freedom would be greater than G under the 

null hypotheses. 


Comparison 

\bs 

G 

^'0,01 

P (F > G) 

A vs Bj 

0.23 

80.89 

-2.51 

3.03 X 10“^® 

A vs B 2 

0,13 

165.37 

-2.51 

1.42 X 10”®^ 

B, VSB 2 

0.27 

65.06 

-2.51 

1.75 X 10-^0 

All 

0.08 

92.76 

-2.04 

6.79 X 10"'78 


A — anomalous sfie 
~ background site I 
B 2 - background site 2 
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Table 4 

Results from univariate tests of mean differences in reflectance for 
combinations of training sites. 




Contrast^ 


All 

A vsB| 

A vs Bj 

Bj vs B 2 

Band 1 

161.23 

259.52 

420.33 

0.36 


* 

* 


NS 

Band 2 

308.11 

87.06 

658.67 

193.54 


4i 

* 

$ 

Id 

Band 3 

278.58 

106.62 

338.96 

233.39 


* 

># 



Band 4 

162.77 

1.28 

192.55 

187,64 


Ip 

NS 

♦ 

Id 

Band 5 

211.33 

18.53 

265.20 

204.63 


* 

* 


Id 

Band 6 

265.54 

42.88 

312.72 

240.00 


# 

<¥ 

* 

Id 

Band 7 

488.63 

198.71 

745.78 

340.56 



* 

♦ 

Id 

Band 8 

406.02 

297.73 

559.91 

275.51 


« 


* 

* 


^Test statistic*! is an F-ratio with, degrees of freedom for: two sites - I, 198; 

three sites ~ 2, 297. 

♦Test significant at a ■ 0.05/8 « 0.006 (Bonferroni adjustment for each contrast, (Neter and 
Wasscrman, 1974)). 

NS - Test not significant at a » 0.006. 

A ~ anomalous site 
B j -* background site 1 
B 2 — background site 2 
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The test procedure used is called multivariate analysis of variance (MANOVA) and procedes In 
the following manner: 


1 . Postulate a model 

Vt) ■ «j + fy (1) 

(t - 1,2 lOOi J - 1,2,3) 

where 

Ytj is an 8 X I random vector of spectral observations of the t* pixel from the site, dis^ 
tributed as a multivariate normal vector deviate with mean equal to and the covariance 
given in an 8 X 8 matrix whose diagonal elements are the variances of the spectral bands and 
whose off diagonal elements arc covariances, This is symbolized as 


jij is an 8 X I vector of means for the site; 

Is an 8 X 1 random vector MN (Q, ?j) 

?j, 1 » k and m ■ n 
0gx8> l=s^k or m#n;m,n* 1,2,3 


wicil 


msi 


m Skn^ 


( 2 ) 

(3) 


This last assumption arose from the additional assumption that the error for one pixel is un- 
correlated with the error for any other pixel. Thus for I k at one site, 

ElSimSkml ’ Etj,„) -2-0^ 

This property is called stochastic independence and is more commonly written as 

EKiin- ’ °8x8> W 

and (4) results when we recall ix, « 0. Since the gj-s’s are the random portion of the model, the 
same stochastic independence assumption applies to the Yy’s. The four modeling assumptions (1) 
to (4) will be collectively known as 12 in further discussion. 

2. Collect Data and Estimate Parameters 

The observed data used has previously been described. The estimates of Mj and will be 
denoted as Yj and Sj respectively and 
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?j ■ I/'OO f Y,J 

Sj - 1/92 f (Y,j-?jKY,j-?j)^. 

Two othir quantities are also estimated from the daH* Tliese are the wlthln-jroups (sites) sums of 

squares and cross products (WSSCP) matrix, W with the estimate, 

^ - I 92 Sj 
j-l ) 

and the total sums of squares and cross products (TSSCP) matrix T, estimated by 

f, (Yy-Y)(Y^-Y)f 

where Y ** 1/3 

3. Test Statistics 

The SSCP associated with W is calculated about the individual test site means, whiie the SSCP 
associated with T used the grand mean over all pixels as a collection factor. Clearly if any of the 
different, the deviation vectors (Yj^ of the pixels in this population will be smaller 
?i.4n ihe deviation vectors formed by the difference of the same pixels and the ^^and mean u * 
i jUj. This observation is the basis for the test statistic used. Tlie determinant of a square n x n 
matrix A, written i A|, , can be thought of as a measurement of the volume in n dimensional space 
of the hyperdimensional solid whose vertices are the columns of the matrix A. TI)e relation be- 
tween the correction factor used in the deviation vector (ajj or ju) and the determinant function is 
that the volume of the matrix gets smaller if ^ for at least one j. The maximum value |W| can 
take on then is when in this case W « T and iW| «*■ |T|. This motivates the test statistics 
which is attributed to Wilks, 1932, and is known as the Wilks-Lambda or A, where 

IT! ' 

Since we only have samples from the population, A is estimated by A with A = |W1 / ITl. Since 
A is a function of it is Itself a statistic and under H and the null hypothesis, n Hg) a func- 
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tion of A is distributed approximately as an statistic or in special cases an F statistic^ with 
degrees of freedom dependent upon the number in the sample, the number of groups (sites) and 
the number of properties (bands). To the calculated value of the statistic A» (an arbitrary but 
fixed real number), is applied the F statistic transformation which yields the value G. After 
choosing the level of type I error willing to be risked (commonly called the <x level), the researcher 
compares G to a critical value, F^, corresponding to the ot level of the F distribution present under 
n n Hg, the decision rule is then 

G < Fg retain null hypothesis 

G > Fg reject null hypothesis and conclude, at least one ^ ju. 

The hypothesis testing environment is pictured in Figure 2. The purpose of this paper is now recast 
as whether F^, as just constructed yields the desired decision rule when the previously stated 
assumption (4) under ^2 is relaji..4. Do we still have a Pr (type I error) = a when we allow the 
observation vectors to be dependent? 

The presence of a dependence structure in the remote sensing data has been documented by a 
number of researchers whose work was summarized previously. Based on these previous papers we 
will assume that the TMS data follows the autoregression integrated moving average model - 
ARiMA (1,0, 1) - of Box and Jenkins (1970). This model is given by 

Yf = Yj_j -d^ aj_j + a^ [1] 

where: * Yj is the deviation of the gray scale value at pixel t from its mean; 

aj, a^_j are random variables N(0, aj) and cov (a^, a^^j) = 0 [covariance 
between a^ and a^_| ] ; 

<t>^ and are coefficients. 

Under this model we examine assumption (4) as follows: 


^The function of A Is an exact F distribution for the special cases of properties (bands) * 2,3 or groups (training 
sites) * 2,3 (Tatsuoka, 1970). The work herein does fall into one of these special cases. 
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Figure 2. Setting for hypothesis testing in present study. 
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E((Yt-./4)(Yt.i -M)l 

* cov(Yj, Yj.i) (since Yj * Yj -Aty) 

» cov(0j Yj.i -01 +aj, Y^^l) (by[l)) 

* cov(0j Yj.j, Yj^,) + cov(-0j a^^j, Yj^l) + cov(aj, Yj^j) (since CO V is a linear 
operator) 

• a 

* 0 j cov (Yj_i , Yj^l ) “ cov (aj_i , a^.j ) (by independence of a's) 

* « 

■ j var (Y() ~ (by stationarity) 

‘r* 0 (since var (Yj) 

Thus the presence of the ARIMA (I, 0, 1) implies that the observations are independent. 
SETTING UP THE SIMULATION 

Examination of the effect of lack of independence was accomplished through a simulation 
procedure. A simulation approach was adopted because the question we examined concerned the 
adequacy of the central F distribution when the null hypothesis was correct, but the data was de- 
pendent in the fashion described, So the significance of the value of the test statistic yield by the 
training set must be determined by comparison with a frequency distribution occurring under the 
Hg (all class mean vectors are equal), and the suitable lack of independence assumptions. No such 
frequency distribution is tabled. Therefore, the gist of the simulation was to use a Monte Carlo 
approach to generate such a frequency distribution. This was accomplished by calculating G 
values from randomly generated data sets whose means are identical (/Uj = ^2 “ ^3 “ validating 
Hq, and which possessed covariance and autocorrelation structures calculated from the Cotter 
Basin test sites. To provide a control, these frequency distributions were then compared with 
frequency distributions generated from data with the same mean and covariance structure but not 
possessing the ARIMA (I, 0, I) relationship. To create this simulation a set of assumptions was 
needed. These assumptions dealt with the model and error structure model. 

Assumptions 

The simulation used the following set of assumptions; 


QUAUTY 
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1, The first 3 assumptions under fZ, with assumption (1) reduced to Xj ® /f + NqI 

2, Each band is adetjuately represented by an univariate ARIMA (1,0, 1 ) model; a further 
implication of this assumption is that the sequences, formed by scan lines, are stationary 
in the mean and variance, i.e. possess “wide sense” stationarity; 

3, The transfer function between bands has a zero value for the delay parameter. This means 
that any correlation across bands is in phase, or equivalently if ^ is related to Y^j j.j , it 
can be stated as the cross correlation between Yj„( and the ARIMA model relating Y^j^ 
and Yjj , where n and m are bands. 

The Variance-Covariance Matrix of Independent Observations 

The system of ARIMA equations are parameterized by the 's, 0 j ’s and the variances of a^, 
a NID deviate vector. Under the assumptions for the simulation we obtained an estimate of this 
variance-covariance matrix of the independent random vectors as follows. 

Let Yjj and Yj^ be the values of the gray scale for the bands i and j at pixel t of a scan line for 
the same site, then 

Yit " «il Yit-i - ai,., + aj, 

[2] 

'jt = “jt-i 

(By assumption of ARIMA (1,0, 1 ) model). 

Using 'Equations [2} 

cov(Yjt,Vjt) = cov«i, Yj,., aj,., +ai(, Yj,., -9j, aj,., taj,) 

• • 

(Expanding) = cov(^U Yj^.j, Yj^.j) 

+ COV + Wil “jt) 

+ cov(-9ii a;,.,, <l,ji Yj,.,) + covf«i, ai,.,. -Sj, aj,.,) 

+ cov(-»i, aj(.,,aj,) + cov(ai,,«j., 1^.,.,) 

+ cov(ajt, ajj.i) + cov(ajj, ajt) 

(By stationarity and coincident cross band relationships) 
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" 0il 0jl cov(Yjt, 0ji ay + O-0ji0jiajj 

+ djU ay +0 + 0 +0 + cry 


where ay is the covariance of bands i and j based on the independent random vectors. Tlierefore; 


"ij 


cov 


(Y|t. Yjt) 


( 3 ) 


1 + ay0ji 0jj 0j| 

and ay is the estimate of ay formed by estimating all the quantities on the right-hand side of [3] 


Estimates of (f> and d 

The coefficients of the ARIMA (1, 0, 1) model, and were estimated for each of the sites. 
For each test site and for the eight spectral barids, three scan lines were chosen with each scan line 
composed of 100 pixels incorporating the test sites and the surrounding region in the east-west 
direction. The (f>^ and coefficients were estimated using a maximum likelihood procedure as 
implemented in the program by Pack et al. (1972). The value of did not vary significantly over 
bands within one test site. However 6 j did exhibit significant variation with spectral band over a 
test site and so we decided to run the simulation three times Using the maximum, median and 
minimum value of as these coincided with the range of values of found at each site. The 
values of and used are given in Table 5. 

SIMULATION PROCEDURE 

Figure 3 displays the steps in the simulation procedure. The first two steps were previously 
described and are the estimation of descriptive statistics such as the mean vectors, variance- 
covariance matrix and ARIMA coefficients, some of which were inputs at later points in the simu- 
lation. The variance-covariance matrices were then adjusted for the presence of autocorrelation as 
previously described. This yielded a new set of variance-covariance matrices, Then using the 
subroutine GGNSM from the International Mathematical and Statistical Libraries (IMSL, 1979) 
three sets (one for each site) of 150 random vectors were generated so that the individual vectors 
were stochastically independent and followed a MN (0 ?j*) distribution. The appropriate ARIMA 
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CALCULATE 1 FOR EACH TEST 
SITE FROM GENERATED DATA 


COMPUTE WITHIN AND TOTAL 
DISPERSION MATRICES 


CALCUUTE DETERMINANTS 
OF WITHIN AND TOTAL DISPERSION 


COMPUTE WILKS LAMSDA 
CRITERION (A) AND F VALUE 


STOP 


Figure 3. Flow in simulation procedure. 
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Table 5 

Values of and used In simulation. 



Bj - background site 1 
B 2 “ background site 2 

* 

models for each site was then added to the vectors, This was accomplished by letting Sj = Yj and 
then using the recursive relationship formed by the ARIMA (1, 0, 1) model. The first 50 vectors 

A A 

were discarded to protect against transient or start up effects. From these three data sets, W, T 
and their determinants were calculated (using the LINV2P subroutine of IMSL), followed by 
AqIjj and G. This procedure was repeated 1000 times for each set of and ^ j used and yielded 
a frequency distribution of G. Using the same random seed string, 1000 values of G were calcu- 
lated for each of the following; 

1. Maximum Autocorrelation 

2. Median Autocorrelation 

3. Minimum Autocorrelation 

4. No Autocorrelation^ 


"The ‘no autocorrelation’ iteration of the process was performed using both “adjusted” and “nonadjusted” co- 
variance matrices, The results were identical. 
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RESULTS 

Table 6 gives the critical or 95 and 98,75 percentile^ values of the frequency distribution 
formed from the four iterations for the four comparisons (three paired site comparisons and one 
comparison of all three sites) plus the corresponding observed values which were calculated using 
the training sets selected for Cotter Basin, The same information for the comparison of all three 
sites is shown in Figure 4, which is the generated frequency distributions of G. The critical values 
for the generated data were determined from the observed cumulative frequency distribution, 
such that the desired proportion (either .05 or 0.0125) of the observations were greater than 
the critical vak?. 


*■’ K ‘ t 
g jf". ' 

QUAUTY 


Table 6 

Critical percentile values from generated frequency distributions versus G observed at Cotter Basin. 



Observed 

Values 

Simulated Values 

Degree of Autoccrreiation 





None 


Most 


Median 


Least 

Comparison 


i 

0.05 0.0125 

0.05 0.0125 

0.05 

0.0125 

0,05 

0.0125 

All 

92.8 

2.50 3.05 

216 

328 

102.5 

140,0 

28.0 

35,2 

A,Bi 

80.9 

2.00 2.80 

140 

196 

75.2 

105.6 

13.5 

20.4 


65.0 

2.00 2.68 

107.5 

157.5 

67,2 

86.4 

22,0 

30.5 

A, B 2 

165.4 

2.04 2.68 

122.5 

162.5 

60.0 

79.2 

25.2 

33.2 


A - anomalous site 
B| ~ background site 1 
B 2 - background site 2 


The critical values from the iteration with no autocorrelation are slightly greater for all four com- 
parisons than they would be under i2 O (compare Tables 3 and 6). We attribute this to the 
unequal variance-covariance matrices from the three sites. The inflation of the critical value under 
the other three iterations in which autocorrelation is present is considerable. In three of the four 


^The 98. 7S percentile correspponds to an a « 0.0 12S significance level representing a Bonferroni level adjusted to 
yield an overali a level of 0.05 for the four comparisons within a given iteration of the simulation. 


19 






PAQC IS 

quality 


* i 



9 ^ 

30V1N33«3^ 3DV1N3DM3^ 


20 


ORIGINAL 
OF POOR 


1 


Figure 4. Frequency distribution of G values from generated data witli ditieiine degrees of autocorrebtion. all three sites 
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comparisons the observed value is less than the critical value of the frequency distribution generated 
from the “maximally’’ autocorrelated data. This implies that If observations were autocorrelated to 
the maximum degree, autocorrelation and not mean differences could account for the variation 
seen in the observed data (recall that the mean vectors of the generated data sets are not significantly 
different from 0). The fourth comparison, that between the anomalous and second background 
site, is just significant at the 0.0125 probability level and this result suggests a need for further 
examination. The important point here is that even for this comparison, the observed value of G as 
a test of mean differences is not, as it first appeared to be, so overwhelmingly significant when 
compared to the critical values derived from the simulation. 

A similar result held true for the first three comparisons under the median iteration, that is 
none of the comparisons were significant at an a « 0,0125 level. Only under the minimum Iteration 
did all four comparisons appear significant at the a » 0.0125 level. Even for this iteration the 
critical values were inflated by at least time times. 

The relationship between the simulated data possessing varying degrees of autocorrelation and 
the uncorrelated data is summarized in Table 7. For each comparison, 50 sets of values of G were 
randomly selected. Each set contained four values of G, one from each iteration; all values of G 
in a set were based on the same random number seed. Figure 5 uses the data generated from the 
comparison of all three sites to display the values of G from the three autocorrelated iterations 
plotted individually against their corresponding non-autocorrelated values, The inflation in the 
value of G, as was indicated earlier, increases with the degree of autocorrelation, The slope 
(Table 7), as the inflation multiplier, varied between 7 and 70, Clearly r^ must increase as the 
degree of autocorrelation is decreased since G based on autocorrelation data approaches the value 
ofG based on non-autocorrelated data. This is an outcome of the common generation procedure. 
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Figure 5. Plots of G values from autocorrelaied observations versus G values from non-autoconelated observations from all ates. 
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Table 7 

Hdatlonship between F statistics calculated from observations possessing differing degree of 
autocorrelation. Abscissa F values are calculated from corresponding non-autocorrelated 

data sets. 


Comparison 

y 

Slope 

Intercept 

R2 


Most 

67.85 

2.46 

0.401 

All 

Median 

30,24 

6.39 

0.418 

Least 

11.00 

-0.81 

0.783 


Most 

47.18 

12.94 

0,397 

A,Bj 

Median 

28.15 

4,10 

0.534 

Least 

6.42 

0.22 

0,820 


Most 

36.23 

13.88 

0.374 

Bi.Bj 

Median 

20.72 

9.12 

0.360 

Least 

9.80 

0.72 

0.757 


Most 

39.76 

15.29 

0.391 

A, B, 

Median 

18.64 

8.82 

0.374 


Least 

10.31 

1.78 

0,692 


A - anomalous site 
Bj -- background site I 
B -7 - background site 2 


SUMMARY AND CONCLUSIONS 

For a geobotanical test site at Cotter Basin, Montana, we have examined the MANOVA hypo- 
thesis testing situation for differences in the mean vectors, replacing the assumption of independ- 
ence among the pixels (random vectors) In the training set by the assumption of an ARIMA (1,0, I) 
model for sequence of pixels. The examination was accomplished by a simulation which results in 
frequency distributions of variables which should be distributed as F under the null hypothesis that 
the mean vectors are equal, 

Using TMS data acquired over Cotter Basin and previous research In this area, we selected test 
sites from a geobotanical anomaly and two suitably constrained background area. From Initial 
comparisons of the mean vectors, we concluded that the test sites were very significantly different 
from one another based on reflectance and emittance in the eight TMS bands, Using estimates of 
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the covariances and autocorrelation developed from the test sites, data sets with mean vectors 
equal to zero were generated, The generated data was then manipulated to form the test statistics. 
Comparison of the frequency distributions of these test statistics with the Initial result led us to 
conclude that the variation in mean reflectance from the test sites could be explained by the 
presence of autocorrelation In the data and not differences related to the presence of a geobotanical 
anomaly. From this result and those of other researchers, we believe It Is a grevlous, Indeed fatal 
error, to use contiguous pixels as the training or testing set with most conventional statistical 
procedures; and conclusions drawn from analyses which do not observe this constraint should be 
treated circumspectly. This claim Includes both parametric and nonparametric procedures. If 
procedures requiring indpendent observations are needed then a grid sample (often with a spacing 
> 10 pixels (Craig, 1978: Labovitz and Masuoka, 1981)1 or a suitable filtering algorithm should 
be used to remove the autocorrelation, 
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